Goto

Collaborating Authors

 covariate-powered empirical bayes estimation


Covariate-Powered Empirical Bayes Estimation

Neural Information Processing Systems

We study methods for simultaneous analysis of many noisy experiments in the presence of rich covariate information. The goal of the analyst is to optimally estimate the true effect underlying each experiment. Both the noisy experimental results and the auxiliary covariates are useful for this purpose, but neither data source on its own captures all the information available to the analyst. In this paper, we propose a flexible plug-in empirical Bayes estimator that synthesizes both sources of information and may leverage any black-box predictive model. We show that our approach is within a constant factor of minimax for a simple data-generating model. Furthermore, we establish robust convergence guarantees for our method that hold under considerable generality, and exhibit promising empirical performance on both real and simulated data.


Reviews: Covariate-Powered Empirical Bayes Estimation

Neural Information Processing Systems

The Authors provide a simple but powerful approach to empirical bayesian inference under rather broad assumptions. The method is relevant in settings where both a) standard statistical estimators (such as the average) can be evaluated and b) covariates can be used to train machine learning models to estimate the same value. The paper tries to solve this problem in the setting where the standard estimator is not reliable enough (e.g. because sample size is too small) and the covariates only give weak information on the target variable. The problem setting considered is highly relevant in many real-world settings. Considering the practical relevance and theoretical interest in empirical bayes methods, it seems quite surprising that this approach has not been investigated earlier (only for special cases such as linear models).


Reviews: Covariate-Powered Empirical Bayes Estimation

Neural Information Processing Systems

This theory paper provides a number of novel results, including theoretical analysis of minimax bounds and an empirical analysis, for combinations of relatively simple statistical estimators and machine learning models of covariate information. The paper shows that these combinations improve on both the simple estimator alone and the machine learning model alone. The main concern raised by the reviewers is that the paper provides limited empirical validation. I disagree with this assessment, as the paper should be seen as a machine learning theory paper. As the proposed framework includes a number of advanced machine learning models, including XGBoost it should be very relevant for the NeurIPS community.


Covariate-Powered Empirical Bayes Estimation

Neural Information Processing Systems

We study methods for simultaneous analysis of many noisy experiments in the presence of rich covariate information. The goal of the analyst is to optimally estimate the true effect underlying each experiment. Both the noisy experimental results and the auxiliary covariates are useful for this purpose, but neither data source on its own captures all the information available to the analyst. In this paper, we propose a flexible plug-in empirical Bayes estimator that synthesizes both sources of information and may leverage any black-box predictive model. We show that our approach is within a constant factor of minimax for a simple data-generating model.


Covariate-Powered Empirical Bayes Estimation

Neural Information Processing Systems

We study methods for simultaneous analysis of many noisy experiments in the presence of rich covariate information. The goal of the analyst is to optimally estimate the true effect underlying each experiment. Both the noisy experimental results and the auxiliary covariates are useful for this purpose, but neither data source on its own captures all the information available to the analyst. In this paper, we propose a flexible plug-in empirical Bayes estimator that synthesizes both sources of information and may leverage any black-box predictive model. We show that our approach is within a constant factor of minimax for a simple data-generating model.